An XML-Based Lightweight C++ Fact Extractor
نویسندگان
چکیده
A lightweight fact extractor is presented that utilizes XML tools, such as XPath and XSLT, to extract static information from C++ source code programs. The source code is first converted into an XML representation, srcML, to facilitate the use of a wide variety of XML tools. The method is deemed lightweight because only a partial parsing of the source is done. Additionally, the technique is quite robust and can be applied to incomplete and non-compile-able source code. The trade off to this approach is that queries on some low level details cannot be directly addressed. This approach is applied to a fact extractor benchmark as comparison with other, abet heavier weight, fact extractors. Fact extractors are widely used to support understanding tasks associated with maintenance, reverse engineering and various other software engineering tasks.
منابع مشابه
The CLaRK System Tools XML-based Corpora Development
CLaRK is an XML-based software system for corpora development. It incorporates several technologies: XML technology; Unicode; Regular Cascaded Grammars; Constraints over XML Documents. The basic components of the system are: a tagger, a concordancer, an extractor, a grammar processor, a constraint engine.
متن کاملDevelopment of Corpora within the CLaRK System: The BulTreeBank Project Experience
CLaRK is an XML-based software system for corpora development. It incorporates several technologies: XML technology; Unicode; Regular Cascaded Grammars; Constraints over XML Documents. The basic components of the system are: a tagger, a concordancer, an extractor, a grammar processor, a constraint engine.
متن کاملOntology-based Access to heterogeneous XML Data
With the increase in popularity of XML on the Internet, the requirements of database management systems have shifted from traditional transaction-based databases towards the kind of characteristics provided, by design, by the Lightweight Directory Access Protocol. At the same time, the design and use of a middleware to provide a common querying interface to XML-based systems has become an incre...
متن کاملDescriptor-based spatialization
The translation of monophonic soundtracks to new audio formats is the object of a growing demand particularly from the DVD producers. However operations like “upmixing” a monophonic track to a multi-channel format are time-consuming tasks for the sound engineer, who has to choose, adapt and tune different spatialization tools. In order to simplify the upmix, we introduce a new spatialization ap...
متن کاملXML Topic Map Builder: Specification and Generation
Everyday thousands of new information resources are linked to the web. This way the web is growing very fast what makes search tasks more difficult. To solve the problem several initiatives were undertaken and a new area of research and development emerged: the one called Semantic Web. When we refer to the semantic web we are thinking about a network of concepts. Each concept has a group of rel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003